Speech and Audio Coding for Multimedia Communications

نویسنده

  • Peter Noll
چکیده

We have seen rapid progress in high-quality compression of telephone speech and wideband speech signals. Linear prediction, subband coding, transform coding, as well as various forms of vector quantiza-tion and entropy coding techniques have been used to design efficient coding algorithms which can achieve substantially more compression than was thought possible only a few years ago. In the case of audio coding with its bandwidth of 20 kHz and more, the concept of perceptual coding has paved the way for significant bit rate reductions. The paper will explain basic approaches to such compressions , with concentration on existing and upcom-ing international standards. As typical signal classes we shall consider telephone speech, wideband speech, and wideband audio signals all of which differ in listener expectation of offered quality. The main motivations for low bit rate coding are outlined as well as basic and network-related requirements. It will become obvious that speech and audio coders must be both source-specific and hearing-specific to perform adequately at low bit rates.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Digital Audio for Multimedia

The paper covers key technologies in wideband audio coding including auditory masking, perceptual coding, frequency domain coding, and dynamic bit allocation. The MPEG standardization work is then described. MPEG algorithms have found a wide range of communications-based and storage-based applications. For example, the European digital audio broadcast (DAB) makes use of MPEG -1. It will then be...

متن کامل

Cipher text only attack on speech time scrambling systems using correction of audio spectrogram

Recently permutation multimedia ciphers were broken in a chosen-plaintext scenario. That attack models a very resourceful adversary which may not always be the case. To show insecurity of these ciphers, we present a cipher-text only attack on speech permutation ciphers. We show inherent redundancies of speech can pave the path for a successful cipher-text only attack. To that end, regularities ...

متن کامل

Perceptual Coding of Narrowband Audio Signals

New applications such as Internet broadcast and communications, consumer multimedia products, digital AM broadcast and satellite networks are emerging. Those applications require moderate audio quality without annoying artifacts at bit rates below 16 kbit/s. Although speech coders provide high speech quality at bit rates around 8 kbit/s, they perform poorly when encoding audio signals. In this ...

متن کامل

Percept ual Coding of Narrowband Audio

New applications such as Internet broadcast and communications, consumer multimedia products, digit al AM broadcast and satellite networks are emerm$ng. Those applications require moderate audio quality without annoying artifacts at bit rates below 16 kbit/s. Although speech coders provide high speech quaüty a t bit rates around 8 kbit/s, they perform poorly when encoding audio signals. In this...

متن کامل

[The power of speech].

fects. In effect, the mission would be a topographic imager that would yield a water map of volumetric gain or loss after each overpass (14). Such a satellite mission would enable hydrologists to move beyond the point-based gauging methods of the past century to measurements of the spatial variability inherent in surface water hydrology. Global coverage would ensure that, despite local economic...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999